THE JOHNS HOPKINS UNIVERSITY Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation
نویسندگان
چکیده
Human language is a combination of elemental languages/domains/styles that change across and sometimes within discourses. Language models, which play a crucial role in speech recognizers and machine translation systems, are particularly sensitive to such changes, unless some form of adaptation takes place. One approach to speech language model adaptation is self-training, in which a language model’s parameters are tuned based on automatically transcribed audio. However, transcription errors can misguide self-training, particularly in challenging settings such as conversational speech. In this work, we propose a model that considers the confusions (errors) of the ASR channel. By modeling the likely confusions in the ASR output instead of using just the 1-best, we improve self-training efficacy by obtaining a more reliable reference transcription estimate. We demonstrate improved topic-based language modeling adaptation results over both 1-best and lattice selftraining using our ASR channel confusion estimates on telephone conversations.
منابع مشابه
Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation
Human language is a combination of elemental languages/domains/styles that change across and sometimes within discourses. Language models, which play a crucial role in speech recognizers and machine translation systems, are particularly sensitive to such changes, unless some form of adaptation takes place. One approach to speech language model adaptation is self-training, in which a language mo...
متن کاملPatient Safety and Healthcare Quality: The Case for Language Access
This paper aims to provide a description of the need for Culturally and Linguistically Appropriate Services (CLAS) for Limited English Proficient (LEP) patients, an identification of how the lack of CLAS for LEP patients can compromise patient safety and healthcare quality, and discuss barriers to the provision of CLAS.
متن کاملUnsupervised topic adaptation for morph-based speech recognition
Topic adaptation in automatic speech recognition (ASR) refers to the adaptation of language model and vocabulary for improved recognition of in-domain speech data. In this work we implement unsupervised topic adaptation for morph-based ASR, to improve recognition of foreign entity names. Based on first-pass ASR hypothesis similar texts are selected from a collection of articles, which are used ...
متن کاملLarge-vocabulary audio-visual speech recognition: a summary of the Johns Hopkins Summer 2000 Workshop
We report a summary of the Johns Hopkins Summer 2000 Workshop on audio-visual automatic speech recognition (ASR) in the large-vocabulary, continuous speech domain. Two problems of audio-visual ASR were mainly addressed: Visual feature extraction and audio-visual information fusion. First, image transform and model-based visual features were considered, obtained by means of the discrete cosine t...
متن کاملTc-99m MIBI imaging in lymphomas: Comparison with T1-201 and Ga-67 scientigraphy
Tc-99m MIBI has recently been used in the functional imaging of various tumors. This prospective study was performed to evaluate the role of Tc-99m MIBI imaging at the time of the initial staging, assessment of treatment response, follow-up studies and survaillance in Hodgkin’s and non-Hodgkin’s Iymphoma. 25 patients (14 with Hodgkin’s and 11 with non-Hodgkin’s Iymphoma) underwent 32 stud...
متن کامل